Effect of Data Transformation on Residue

نویسندگان

  • Hyuk Cho
  • Inderjit S. Dhillon
چکیده

Recently, Aguilar-Ruiz [2005] considers a data matrix containing both scaling and shifting factors and shows that the mean squared residue [Cheng and Church, 2000], called RESIDUE(II) in this paper, is useful to discover shifting patterns, but not appropriate to find scaling patterns. This finding draws our attention on the weakness of RESIDUE(II) measure and the need of new approaches to discover both scaling and shifting patterns in the considered matrix. To resolve the weakness of RESIDUE(II) in finding scaling patterns, we propose a simple remedy that still uses the same residue measure. The main idea is to remove hidden scaling factors in the considered data matrix by taking a specific data transformation. We investigate various data transformations including no transformation, double centering, mean centering, standard deviation normalization, and Z-score transformation. Further, we apply these data transformations to row/column dimension of data matrix models with different global/local scaling and global/local shifting factors. First, we characterize the properties of the data transformations on different data matrix models, including six Euclidean co-clustering schemes in Bregman co-clustering algorithms [Banerjee et al., 2007] and other existing data models in the literature. In particular, we formally analyze the effect of each data transformation on the two residues [Cho et al., 2004], here called RESIDUE(I) and RESIDUE(II), respectively. Then, we apply all the data transformations to publicly available human cancer gene expression datasets and empirically validate the analysis results by using the minimum sum squared residue co-clustering (MSSRCC) algorithms [Cho et al., 2004]. In conclusion, through column standard deviation normalization or column Z-score transformation, we are able to overcome the shortcoming of RESIDUE(II) in finding scaling patterns and discover both scaling and shifting patterns.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Investigation of the Effect of Voiding Position on Uroflometric Parameters and Voiding Residue in Healthy Volunteers in Imam Reza Hospital, Mashhad, Iran

Background & Aims: Uroflowmetry is a common procedure to examine the lower urinary tract system. Uroflowmetry results are affected by different factors. In this study, the effect of voiding position on uroflowmetric parameters and voiding residue were investigated in healthy subjects. Methods: This descriptive–analysis study was performed on 41 healthy volunteers with mean age of 33.22 ± 9.45 r...

متن کامل

The Effect of Aspartate-Lysine-Isoleucine and Aspartate-Arginine-Tyrosine Mutations on the Expression and Activity of Vasopressin V2 Receptor Gene

Background: Vasopressin type 2 receptor (V2R) plays an important role in the water reabsorption in the kidney collecting ducts. V2R is a G protein coupled receptor (GPCR) and the triplet of amino acids aspartate-arginine-histidine (DRH) in this receptor might significantly influence its activity similar to other GPCR. However, the role of this motif has not been fully confirmed. Therefore, the ...

متن کامل

Evaluation of Tillage, Nitrogen Fertilizer and Crop Residue Management on some Agronomic Traits of Soybean

This study setout to investigate the effect of wheat residue, tillage, and nitrogen fertilizer management on some agronomic traits of soybean as a split split plot based on randomized complete block design with three replications. The main plots included wheat residue management: collecting and leaving residue and sub plot included tillage (without tillage and conventional tillage), and the sub...

متن کامل

The effect of quenching media and annealing temperature on graphitization transformation kinetic of CK100 tool steel

In this research, graphitization transformation of a commercial hypereutectoid steel called CK100 was studied by the dilatometric experiments at the range of 600 – 700 °C from prior martensitic structure. Also the effect of quenching media on the initial graphitization time and completion of transformation has been discussed. Also, graphitization transition from the different prior microstructu...

متن کامل

Effect of tillage and residue management on productivity of soybean and physico-chemical properties of soil in soybean–wheat cropping system

A microplot experiment was conducted in soybean–wheat cropping system at New Delhi during 2010-11 and 2011-12 to study the effect of continuous or cyclic tillage, viz., conventional tillage (CT) and zero-tillage (ZT) and residue management of either soybean (SR) and/or wheat (WR) on yield performance and soil physico-chemical properties. The experiment was laid out in randomized block desi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007